首页> 外文OA文献 >A Transprecision Floating-Point Platform for Ultra-Low Power Computing
【2h】

A Transprecision Floating-Point Platform for Ultra-Low Power Computing

机译:用于超低功耗计算的Transprecision浮点平台

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

In modern low-power embedded platforms, floating-point (FP) operations emergeas a major contributor to the energy consumption of compute-intensiveapplications with large dynamic range. Experimental evidence shows that 50% ofthe energy consumed by a core and its data memory is related to FPcomputations. The adoption of FP formats requiring a lower number of bits is aninteresting opportunity to reduce energy consumption, since it allows tosimplify the arithmetic circuitry and to reduce the memory bandwidth betweenmemory and registers by enabling vectorization. From a theoretical point ofview, the adoption of multiple FP types perfectly fits with the principle oftransprecision computing, allowing fine-grained control of approximation whilemeeting specified constraints on the precision of final results. In this paperwe propose an extended FP type system with complete hardware support to enabletransprecision computing on low-power embedded processors, including twostandard formats (binary32 and binary16) and two new formats (binary8 andbinary16alt). First, we introduce a software library that enables explorationof FP types by tuning both precision and dynamic range of program variables.Then, we present a methodology to integrate our library with an external toolfor precision tuning, and experimental results that highlight the clearbenefits of introducing the new formats. Finally, we present the design of atransprecision FP unit capable of handling 8-bit and 16-bit operations inaddition to standard 32-bit operations. Experimental results on FP-intensivebenchmarks show that up to 90% of FP operations can be safely scaled down to8-bit or 16-bit formats. Thanks to precision tuning and vectorization,execution time is decreased by 12% and memory accesses are reduced by 27% onaverage, leading to a reduction of energy consumption up to 30%.
机译:在现代低功耗嵌入式平台中,浮点(FP)操作成为动态范围较大的计算密集型应用程序能耗的主要贡献者。实验证据表明,内核及其数据存储器消耗的能量的50%与FP计算有关。要求较少位数的FP格式的采用是减少能耗的有趣机会,因为它可以简化算术电路并通过实现矢量化来减少内存和寄存器之间的存储带宽。从理论上讲,采用多种FP类型完全符合超精密计算的原理,可以对近似值进行细粒度的控制,同时满足最终结果精度的特定限制。在本文中,我们提出了一个扩展的FP类型系统,该系统具有完整的硬件支持,可在低功耗嵌入式处理器上实现高精度计算,包括两种标准格式(binary32和binary16)和两种新格式(binary8和binary16alt)。首先,我们介绍一个软件库,该软件库可以通过调整程序变量的精度和动态范围来探究FP类型,然后介绍一种将库与外部工具集成以进行精度调整的方法,并通过实验结果突出介绍引入该方法的明显好处。新格式。最后,我们提出了一种超精密FP单元的设计,该FP单元除了可以处理标准的32位操作之外,还可以处理8位和16位操作。 FP密集型基准测试的实验结果表明,高达90%的FP操作可以安全地缩减为8位或16位格式。得益于精确的调整和矢量化,平均执行时间减少了12%,内存访问平均减少了27%,从而使能耗降低了30%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号